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Many ribosomes simultaneously move on the same messenger RNA (mRNA), each separately 
synthesizing the protein coded by the mRNA. Earlier models of ribosome traffic represent each 

£N| ■ ribosome by a "self-propelled particle" and capture the dynamics by an extension of the totally 

asymmetric simple exclusion process (TASEP). In contrast, here we develope a theoretical model 
that not only incorporates the mutual exclusions of the interacting ribosomes, but also describes 

^* . explicitly the mechano-chemistry of each of these individual cyclic machines during protein synthesis. 

Using analytical and numerical techniques of non-equilibrium statistical mechanics, we analyze this 
model and illustrate its power by making experimentally testable predictions on the rate of protein 
synthesis in real time and the density profile of the ribosomes on some mRNAs in E-Coli. 
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Translation, the process of synthesis of proteins by de- 
coding genetic information stored in the mRNA, is car- 
ried out by ribosomes. Understanding the physical prin- 
ciples underlying the mechanism of operation of this com- 
plex macromolecular machine [l| will not only provide in- 
sight into the regulation and control of protein synthesis, 
but may also find therapeutic applications as ribosome is 
the target of many antibiotics [2] ■ 

Most often many ribosomes move simultaneously on 
the same mRNA strand while each synthesises a protein. 
In all the earlier models of collective traffic-like move- 
ments of ribosomes 0& LI LI St* 111 > the entire ribosome 
is modelled as a single "self-propelled particle" ignoring 
its molecular composition and architecture. Moreover, in 
these models the inter-ribosome interactions are captured 
through hard-core mutual exclusion and the dynamics of 
the system is formulated in terms of rules that are essen- 
tially straightforward extensions of the TASEP [jjj] . 

In reality, the mechanical movement of each ribo- 
some is coupled to its biochemical cycle. The earlier 
TASEP-like models cannot account for those aspects of 
spatio-temporal organization that depend on the detailed 
mechano-chemical cycle of each ribosome. In this letter 
we develope a "unified" model that not only incorporates 
the hard-core mutual exclusion of the interacting ribo- 
somes, but also captures explicitly the essential steps in 
the biochemical cycle of each ribosome, including GTP 
(guanine triphosphate) hydrolysis, and couples it to its 
mechanical movement during protein synthesis. Conse- 
quently, in the low-density limit, our model accounts for 
the protein synthesis by a single isolated ribosome while 
at higher densities the same model predicts not only the 
rate of protein synthesis but also the collective density 
profile of the ribosomes on the mRNA strand. 

We represent the mRNA chain, consisting of L codons, 
by a one-dimensional lattice of length L+l — 1 where each 
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of the first L sites from the left represents a single codon 
(i.e., a triplet of nucleotides). We label the sites of the 
lattice by the integer i; the sites i = 1 and i — L represent 
the start codon and the stop codon, respectively. 

The smaller sub-unit of the ribosome, which is known 
to bind with the mRNA, is represented by an extended 
particle of length £ (in the units of the size of a codon) , as 
shown in fig^a) {£ = 12 for all results reported here). 0, 
IM B H] , Thus, the smaller subunit of each ribosome 
covers £ codons at a time (see figQI a ))- According to 
our convention, the position of such a ribosome on the 
mRNA strand will be given by the position of the lattice 
site covered by the left edge of its smaller subunit. Each 
ribosome moves forward by only one site in each step as it 
must translate successive codons one by one. The mutual 
interactions of the ribosomes translocating on the same 
mRNA is taken into account by imposing the constraint 
of mutual exclusion. 

The process of translation itself can be divided into 
three main stages: (a) initiation, (b) elongation, and (c) 
termination. Since our model is not intended to describe 
initiation and termination in detail, we represent initia- 
tion and termination by the two parameters a and (3, re- 
spectively (see figCJa)). If the first £ sites on the mRNA 
are vacant, this group of sites is allowed to be covered by 
a ribosome, from the pool of unbound ribosomes, with 
probability a in the time interval At (in all our numeri- 
cal calculations we take At = 0.001 s). Similarly, if the 
rightmost £ sites of the mRNA lattice are covered by a 
ribosome, i.e., the ribosome is bound to the L-th codon, 
the ribosome gets detached from the mRNA with prob- 
ability p in the time interval At. Moreover, since a is 
the probability of attachment in time At, the probability 
of attachment per unit time (which we call u> a ) is the 
solution of the equation a = 1 — e - w « xA *. 

To our knowledge, all the earlier models of ribosome 
traffic on mRNA ]1 H H H & 0, describe elonga- 
tion also by a single parameter, namely, the rate q of 
hopping of a ribosome from one codon to the next. In 
contrast, we model the mechano-chemistry of elongation 
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FIG. 1: (a) The mRNA is represented by a one-dimensional 
lattice each site of which corresponds to a distinct codon; the 
smaller subunit of the ribosome (represented schematically 
by the rectangle) can cover simultaneously £ codons (I — 2 in 
this figure) while the letters E, P, A denote the three binding 
sites on its larger subunit. The parameters a and /3 capture 
the effective rates of initiation and termination of transla- 
tion, (b) The biochemical cycle of a single ribosome during 
the elongation stage. Each box represents a distinct state 
of the ribosome. The index below the box labels the codon 
on the mRNA with which the smaller subunit of the ribosome 
binds. The number above the box labels the biochemical state 
of the ribosome. Within each box, 1(0) represents presence 
(absence) of tRNA on binding sites E, P, A. 1* is a EF-Tu 
bound tRNA and G is a EF-G GTPase. The symbols ac- 
companied by the arrows define the rate constants for the 
corresponding transitions. The dashed arrow represents the 
approximate pathway we have considered in our model. 



the ribosome moves forward by one codon (i.e., one step 
on the lattice). 

However, in setting up the rate equations below, we 
treat the entire transition 5 — > 6 — > 7 — > 1 as, effectively, 
a single step transition from 5 to 1, with rate constant 
LJh2- Thus, throughout this paper we work with a sim- 
plified model where each biochemical cycle during the 
elongation process consists of five distinct states. 

The modelling strategy adopted here for incorporating 
the biochemical cycle of ribosomes is similar to that fol- 
lowed in the recent work [11] on single-headed kinesin mo- 
tors KIF1A. However, the implementation of the strat- 
egy is more difficult here not only because of the higher 
complexity of composition, architecture and mechano- 
chemical processes of the ribosomal machinery and but 
also because of the heterogeneity of the mRNA track [l^] • 

Let Pfi{i) be the probability of finding a ribosome at 
site i, in the chemical state /x. Then, P(i) = 5TJ w =i -^W) 
is the probability of finding a ribosome at site i, irre- 
spective of its chemical state. Moreover, if a site is not 
covered by any part of any ribosome, we'll say that the 
site is occupied by a "hole" . Furthermore, by the symbol 
Q(i\j) we denote the conditional probability that, given 
a ribosome at site i, there is a hole at the site j. The 
master equations for the probabilities P^(i) are given by 



dPiji) 
dt 



■ uj h2 P 5 (i-l)Q(i-l\i-l + £) 
+oj p P 2 (i) -u) a P\{i) 



(1) 



dP 2 (i) 
dt 



u a P 1 (i)-(cu p +Lu hl )P 2 (i) (2) 



dP 3 (i) 
dt 



= u h iP 2 (i) - k 2 P 3 (i) 



(3) 



in detail (figdb)). In state 1, the ribosome begins with 
a tRNA bound to the site P. Binding of a fresh tRNA- 
EF-Tu complex to site A causes the transition 1 — ► 2. 
The EF-Tu has a GTP part which is then hydrolized to 
GDP, driving the transition 2 — > 3. Next, the phosphate 
group, a product of the hydrolysis, leaves resulting in 
the state 4. This hydrolysis, finally, releases the EF-Tu, 
and the peptide bond formation becomes possible. Af- 
ter this step, the tRNAs shift from site P to E and from 
site A to P; the site A becomes occupied by EF-G, in 
the GTP bound form, resulting in the state 5. Hydrol- 
ysis of the GTP to GDP and the release of EF-G drives 
the transition 5^6. The transition 6 — > 7 is accom- 
panied by conformatinal changes that are responsible for 
pulling the mRNA-binding smaller subunit by one step 
forward. Finally, the tRNA on site A is released, result- 
ing in completion of one biochemical cycle; in the process 



dP 4 (i) 
dt 



k 2 P 3 {l) - LJgP 4 (i) 



dP 5 (i) 



(4) 



(// Lu g P 4 (i) - Lu h2 P 5 (i)Q(i\i + £) (5) 

(i + L) 

However, the equations for P\(l) and Pb{L) have the 
special forms 



dt 



= w a (l-^P(s))+w p P 2 (l)-w a Pi(l) (6) 



s=l 



dP 5 {L) 
dt 



= u g P 4 (L)-pP 5 {L). 



(7) 







a=0.0001 


. 




0.0003 




Maximal Current 


3 

L 


0.0002 




High Density 




0.0001 




\ 


- 




' 



where 



0.02 0.04 0.06 0.08 0.1 



(a) 



0.08 


co h =10s" 1 


Low Density 


- 


0.06 






- 






" 


0.04 






- 


0.02 




. ^J. 


- 



0.0002 



0.0004 



0.0006 



0.0008 



(b) 



FIG. 2: Phase diagram in (a) P bJh — 
plane. co g = 25 s" 1 , lo p = 0.0028 s" 



, a plane and (b) a — P^ 
k 2 = 2.4 s _1 , f3 = l. 



Because of the finite length of the codon sequence be- 
tween the start and stop codons, the open boundary con- 
ditions (OBC) are more realistic than the periodic bound- 
ary conditions (PBC). However, we begin with a calcu- 
lation of the flux of the ribosomes in the steady-state by 
imposing PBC as the results for this artificial situation 
are required for a derivation of the dynamical phase dia- 
gram of the system under OBC. Under PBC, P^(i) for all 
i are governed by the equations ft ) -(S ) . Moreover, under 
the PBC, only four of the five equations ©"© are inde- 
pendent because P(i) = Ea=i^(') = N/L = p where 
p, the number density of the ribosomes, is a constant in- 
dependent of time; therefore, we do not need to consider 
equation QJ for P\{i) explicitly. In the steady state, all 
time derivatives vanish and because of the translational 
invariance of this state under PBC, the index i can be 
dropped. It is straightforward to show [13| that, for PBC, 



Q(i\i + £) 



L- N£ 



L + N - Ni - 1 ' 



(8) 



Therefore, under the PBC, equations i12l.")l) can be solved, 
using |J5J, to obtain 



P 5 = 



P 



UJ h2 (L-N£) r 1 - 

L+N-N£-l\-kef f - 



1 + 



(9) 
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The flux of ribosomes J, under PBC, obtained from 

J = uj h2 P 5 Q(i\i + £), is 



J = 



^2/3(1 - pi) 



(1 



n h2 (i-p£) 



(ii) 



where Qh2 = ^>h2/k e ff. The rate of protein synthesis by 
a single ribosome is £J. This mean-field estimate is a 
reasonably good approximation to the data obtained by 
direct computer simulations |l3 |. 
It can be shown [I3 that, for OBC, 



Q(i\i + i) = 



l-Ef=i^ + s) 



1 - E;=i p(* + s) + p(i + 1) 

and the corresponding flux can be obtained from 

I 

J = u a {l-^P S ) 



(12) 



(13) 



Motivated by the recent measurements [Tj, [13 of the 

number of bound ribosomes on the mRNA, we have com- 
puted the detailed density profiles of the ribosomes and 
also drawn the phase diagrams in the spirit of the simi- 
lar plots of non-equilibrium dynamical phases of totally 
asymmetric simple exclusion process )1C| . 

The probabilities a and (3 of initiation and termination 
are incorporated into the model by connecting the ends 
of the mRNA strand to two hypothetical reseroirs with 
appropriate denities p and p+, respectively [5]. The 
extremum principle |l(l ll(il| then relates the flux j in the 
open system to the flux J(p) for a closed periodic system 
with the same dynamics: 



3 



max J(p) if p- > p > p.) 
rain J{p) if p- < p < p^ 



For systems with a single maximum in the function J(p), 
at p = ps,,, such as equation l|ll|) . the maximal current 
phase sets in when p_ > p* > P+- By differentiating 
equation l|ll|) . we find [I3 



-^(i + n h2 ) + ^(i + n h2 ) 



p* 

i[i-t-n h2 t) 

It can also be shown that [I3 

a(l-^)(l + fi M ) 



(14) 



P- 



P Uh -a(l + n h2 )(l-i) 



(15) 



where P Uh is the probability of hydrolysis in the time At, 
and that p + = 0. Similarly, the probability of attachment 
of a aa — tRNA in time At is denoted by P UJa . Thus, the 
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FIG. 3: Flux of ribosomes plotted against (a) w/, and (b) 
a for the genes err (170 codons) and cysK (324 codons) of 
Escherichia coli K-12 strain MG1655, as well as the cor- 
responding curve for a homogenous mRNA strand of 300 
codons. The insets show the average density profiles on a 
hypothetical homogeneous mRNA track for four different val- 
ues of (a) uJh and (b) a, for fixed u> a — 25 s~ . 



phase boundaries between the various phases have been 



obtained by solving the equation 

p-(a,U> a ,Ulhl,Ulh2) = p*(a,U> a ,Ulhl,Ulh2) (16) 

numerically, and two typical phase dia gram s have been 
plotted in figsGIa) and (b) assuming [I3,IlSl u hi — ^hi — 

OJh- 

We focus on genes of Escherichia coli K-12 strain 
MG1655 [13 • We directly simulate the system by assum- 
ing that the site dependent transition rate uo a is propor- 
tional to the percentage availability of the corresponding 
aa-tRNA for that codon, in the E Coli cell [U,l2l|]. In 
figure 0, we see how the current increases as u>h (in 
(a)) and a (in (b)) increases and gradually saturates; 
the saturation value of the current is numerically equal 
to the maximum current obtained in the corresponding 
case with PBC [l3|. Simultaneously, the average density 
of the ribosomes decreases in (a) (and increases in (b)) 
as the parameter u>h in (a) (and a in (b)) increases, and 
gradually satuarates. These observations are consistent 
with the scenario of phase transition from one dynamical 
phase to another, as predicted by the extremal current 
hypothesis. Moreover, the lower flux observed for real 
genes, as compared to that for homogeneous mRNA, is 
caused by the codon specificity of the available tRNA 
molecules. 

In this letter we have developed a "unified" theoreti- 
cal model for protein synthesis by mutually interacting 
ribosomes following the master equation approach of non- 
equilibrium statistical mechanics. We have computed (i) 
the rate of protein synthesis in real time and (ii) density 
profile of the ribosomes on a given mRNA, and stud- 
ied their dependences on the rates of various mechano- 
chemical processes in each ribosome. We have illustrated 
the use of our model by applying these to two genes of 
E-Coli and making theoretical predictions in real time 
which, we hope, will motivate new quantitative measure- 
ments. 
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